459 research outputs found
Recommended from our members
Architecture Optimizations for Memory Systems of Throughput Processors
Throughput-oriented processors, such as graphics processing units (GPUs), have been increasingly used to accelerate general purpose computing, including machine learning models that are being utilized in numerous disciplines. Thousands of concurrently running threads in a GPU demand a highly efficient memory subsystem for data supply in GPUs. In this dissertation, we have studied the memory architecture of the traditional GPUs and revealed that the traditional memory architecture, initially designed for graphics processing, is less efficient in handling general purpose computing tasks. We propose several memory architecture optimizations for two primary objectives: (1) optimize current memory architecture for more efficient handling of general purpose computing tasks; (2) improve the overall performance of GPUs.
This dissertation has four major parts: (1) The first part deals with the L2 cache inefficiency. A key factor that affects the memory subsystem is the order of memory accesses. While reordering memory accesses at L2 cache has large potential benefits to both cache and DRAM, little work has been conducted to exploit this. In this work, we investigate the largely unexplored opportunity of L2 cache access reordering. We propose Cache Access Reordering Tree (CART), a novel architecture that can improve memory subsystem efficiency by actively reordering memory accesses at L2 cache to be cache-friendly and DRAM-friendly. (2) The second part deals with miss handling architecture (MHA) in GPUs. Conventional MHA is static in sense that it provides a fixed number of MSHR entries to track primary misses, and a fixed number of slots within each entry to track secondary misses. This leads to severe entry or slot under-utilization and poor match to practical workloads, as the number of memory requests to different cache lines can vary significantly. We propose Dynamically Linked MSHR (DL-MSHR), a novel approach that dynamically forms MSHR entries from a pool of available slots. This approach can self-adapt to primary-miss-predominant applications by forming more entries with fewer slots, and self-adapt to secondary-miss-predominant applications by having fewer entries but more slots per entry. (3) The third part aims to improve the performance of Unified Virtual Memory (UVM), which is recently introduced into GPUs. We propose CAPTURE(Capacity-Aware Prefetch with True Usage Reflected Eviction), a novel microarchitecture scheme that implements coordinated prefetch-eviction for GPU UVM management. CAPTURE utilizes GPU memory status and memory access history to dynamically adjust the prefetching and ``capture'' accurate remaining page reusing opportunities for improved eviction. (4) In the fourth part, we propose a comprehensive UVM benchmark suite named UVMBench to facilitate future research on the UVM research
E-Beam Patterned Gold Nanodot Arrays on Optical Fiber Tips for Localized Surface Plasmon Resonance Biochemical Sensing
Electron beam lithography (EBL) was used to directly pattern periodic gold nanodot arrays on optical fiber tips. Localized surface plasmon resonance of the E-beam patterned gold nanodot arrays on optical fiber tips was utilized for biochemical sensing. The advantage of the optical fiber based localized surface plasmon resonance (LSPR) sensors is the convenience to work with and work in harsh environments. An optical fiber tip LSPR refractive index sensor of 196 nm per refractive index unit (RIU) sensitivity has been demonstrated. The affinity sensing property of the fiber tip sensor was demonstrated using biotin/streptavidin as the receptor/analyte. The detection limit for streptavidin was determined to be 6 pM
Rice Calcineurin B-Like Protein-Interacting Protein Kinase 31 (OsCIPK31) Is Involved in the Development of Panicle Apical Spikelets
Panicle apical abortion (PAA) causes severe yield losses in rice production, but details about its development and molecular basis remain elusive. Herein, a PAA mutant, paa1019, was identified among the progeny of an elite indica maintainer rice line Yixiang 1B (YXB) mutagenized population obtained using ethyl methyl sulfonate. The abortion rate of spikelets in paa1019 was observed up to 60%. Genetic mapping combined with Mutmap analysis revealed that LOC_Os03g20380 harbored a single-bp substitution (C to T) that altered its transcript length. This gene encodes calcineurin B-like protein-interacting protein kinase 31 (OsCIPK31) localized into the cytoplasm, and is preferentially expressed in transport tissues of rice. Complementation of paa1019 by transferring the open reading frame of LOC_Os03g20380 from YXB reversed the mutant phenotype, and conversely, gene editing by knocking out of OsCIPK31 in YXB results in PAA phenotype. Our results support that OsCIPK31 plays an important role in panicle development. We found that dysregulation is caused by the disruption of OsCIPK31 function due to excessive accumulation of ROS, which ultimately leads to cell death in rice panicle. OsCIPK31 and MAPK pathway might have a synergistic effect to lead ROS accumulation in response to stresses. Meanwhile the PAA distribution is related to IAA hormone accumulation in the panicle. Our study provides an understanding of the role of OsCIPK31 in panicle development by responding to various stresses and phytohormones
Status of the SXFEL Facility
The Shanghai soft X-ray Free-Electron Laser facility (SXFEL) is being developed in two steps; the SXFEL test facility (SXFEL-TF), and the SXFEL user facility (SXFEL-UF). The SXFEL-TF is a critical development step towards the construction a soft X-ray FEL user facility in China, and is under commissioning at the Shanghai Synchrotron Radiation Facility (SSRF) campus. The test facility is going to generate 8.8 nm FEL radiation using an 840 MeV electron linac passing through the two-stage cascaded HGHG-HGHG or EEHG-HGHG (high-gain harmonic generation, echo-enabled harmonic generation) scheme. The construction of the SXFEL-TF started at the end of 2014. Its accelerator tunnel and klystron gallery were ready for equipment installation in April 2016, and the installation of the SXFEL-TF linac and radiator undulators were completed by the end of 2016. In the meantime, the SXFEL-UF, with a designated wavelength in the water window region, began construction in November 2016. This was based on upgrading the linac energy to 1.5 GeV, and the building of a second undulator line and five experimental end-stations. Construction status and the future plans of the SXFEL are reported in this paper
4th International Symposium on Software Reliability, Industrial Safety, Cyber Security and Physical Protection of Nuclear Power Plant (ISNPP)
This book presents a compilation of selected papers from the Fourth International Symposium on Software Reliability, Industrial Safety, Cyber Security and Physical Protection of Nuclear Power Plant, held in August 2019 in Guiyang, China. The purpose of the symposium was to discuss inspection, testing, certification and research concerning the software and hardware of instrument and control (I&C) systems used at nuclear power plants (NPP), such as sensors, actuators and control systems. The event provides a venue for exchange among experts, scholars and nuclear power practitioners, as well as a platform for the combination of teaching and research at universities and enterprises to promote the safe development of nuclear power plants. Readers will find a wealth of valuable insights into achieving safer and more efficient instrumentation and control systems
- …